Bytedance
-
Input tokens/M
Output tokens/M
Context Length
MerantixMomentum
Compressible version of Qwen2.5-3B provided by the ACIP project, supporting dynamic model size adjustment while maintaining performance